REMARKS ON THE BOTTCHER-WENZEL INEQUALITY 
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Abstract. In 2005, Bottcher and Wenzel raised the conjecture that if 
X, Y are real square matrices, then \ \XY-YX\\ 2 < 2| \X\ 2 1 \Y | | 2 , where 
|| • || is the Frobenius norm. Various proofs of this conjecture were found 
in the last few years by several authors. We here give another proof. This 
proof is highly conceptual and requires minimal computation. We also 
briefly discuss related inequalities, in particular, the classical Chern-do 
Camo-Kobayashi inequality. 



1. The proof 

The Bottcher- Wenzel inequality was proved in [2] for 2x2 matrices and 
by Laszlo [8] for 3x3 matrices. Different proofs of the full version of the con- 
jecture were obtained by Lu [9], Vong and Jin [10], Bottcher- Wenzel [3], and 
Audenaert pQ . The complex matrix case was treated in Bottcher- Wenzel [3] 
and Wenzel [IT]. A convenient observation that links the complex case to 
the real one can be found in Cheng- Vong- Wenzel [U page 296]. A useful 
observation was obtained in Audenaert [12] for further generalizations. 

In this section, we give a new proof. Let [X, Y] denote the commutator 
of X and Y and let || • || stand for the Frobenius norm (= Hilbert-Schmidt 
norm) . 

Theorem 1. Let X,Y be real n x n matrices. Then 

\\[X,Y]\\ 2 < 2\\X\\ 2 ■ \\Y\\ 2 . 

In accordance with [9J Lemma 3], we made the following definition. Let 
V = Ql(n,M) and define a linear map T = Tx by 

T:V^V, Y H> [X T , [X, Y]], 

where X T is the transpose of X. 
Let 
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and assume that 

(1) a ? + --- + 4 = l. 

Put 

Al = (o o 

Let T Al be the restriction of T Al on V = flt(n, E) © g((n, R) C flt(2n, R). Let 
finally 



(2) ^ 
Then we have 

(3) [A U A] = 
and 



C 

B 



AB-CA 




lA ^ A )-\ A(AB-CA))- 

For the rest of the paper, we make the following generic condition: all Sj 
are distinct and nonzero, sf > > ■ ■ ■ > s„, and all s^ + s| are distinct. 

Lemma 1. T/ie eigenvalues o/TXj must either be or of i/ie /orm sf + s|. 
Let E'jj 6e i/ie matrices whose only nonzero entry 1 is the (i,j)-th entry. 
Then 

(1) The eigenspace of the eigenvalue 2s? is spanned by {B, C) = {En, —En 
for 1 < i < n; 

(2) XTie eigenspace of the eigenvalue sf + sj for i ^ j is spanned by 
(B,C) = (Eij, -%Eij) and (B,C) = {E ju -§#,-*) for 1 < i± j < 
re; 

(3) The eigenspace of the eigenvalue is spanned by (B, C) = {Eij, j^Eij) 
for 1 < i, j < n. 

In particular, the maximum eigenvalue o/Ta 1 is 2sf, of multiplicity 1, and 
the second largest eigenvalue of T\ 1 is sf + s\, of multiplicity 2. 

Proof. Let A in (f2|) be an eigenvector of the eigenvalue A of T\ ± . Then 
we have 

-{AB - CA)A = AC, A{AB - CA) = XB. 
Assuming that bij,Cij are the entries of B,C, respectively, we have 

(4) - SiSjbij = (A - sj)c;j, -SiSjCij = (A - sf)bij 

for 1 < i, j < n. From the above equations, we conclude that the eigenvalues 
of Taj must be the solutions of the equations 

sjs 2 jbijCij = (A — sf)(A — s"j)bijCij 
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and hence be either or s 2 + s 2 . Moreover, for fixed (i, j) and the fixed 
eigenvalue s 2 + s|, we have b rs c rs = except (r, s) = (i,j) or (j, i). Using 
this observation, we find all the eigenvectors of the operator T\ L . 

□ 

Proof of the Theroem Q3 Following [2], we work with the singular 
value decomposition. Let ||X[| = 1 and let 

(5) X = QiAQ 2 

be the singular decomposition of X, where Q\,Qi are orthogonal matrices 
and A is a diagonal matrix. Let 

B = Q2YQ2 1 , C = Q^YQl 

Then we have 

(6) \\[X,Y}\\ 2 = \\AB-CA\\ 2 . 

For fixed X, let Y be a matrix with unit norm such that \\[X, Y]\\ is 
maximized. Then we have 

T X (Y) = X'Y 

by the method of Lagrange multipliers. By [3 Proposition 5] (see also [H 
Proposition 2.4]), [X T ,Y T ] is also an eigenvector of A' and it is linearly 
independent to Y. Let 

Z = aY + j3[X T ,Y T ] 
be a linear combination of Y, [X T ,Y T ] such that 

'Q^ZQt 

Q2ZQ2 , 

is orthogonal to the first eigenspace of T\ x . Since the space of all such A is 
2-dimensional, the linear combination always exists. By ([3]), ([6]), we have 

\\[X,Y]\\ 2 = \\[A 1 ,A}\\ 2 = (A,f Al (A)). 

Using Lemma [H we have 

\\[X,Y]\\ 2 <(sl + s 2 2 )\\A\\ 2 <\\B\\ 2 + \\C\\ 2 = 2\\Y\\ 2 , 

and the theorem is proved. 

□ 



A 



2. Additional Remarks 

Here are some remarks on further generalizations of the Bottcher-Wenzel 
inequality. We first prove the following result. 

Theorem 2. Let X,Y be n x n matrices. Let X be a diagonal matrix and 
let H^Hoo = maxi^j(\yij\), where (yij) are the entries ofY. Then we have 

(7) ||[A,y]|| 2 <||A|| 2 .(||y|| 2 + 2||y||L). 
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Proof. In [9l pp 1293, Remark 1], the theorem was proved assuming that 
Y is symmetric. That is, for any real numbers Ai, • • • , X n , we have 

2^(A, - A,) 2 */ 2 - < ( £ A 2 } . (2 ^ y 2. + 2 ( „ y) 2 )i 

i<j j i>j 

This implies that, for strictly upper triangular matrix Y\, 

ll^niii^iixip-dmii 2 + i|yi|i2o). 

By using the same argument, the above inequality is also true for strictly 
lower triangular matrices. Let Y = Yq + Y\ +Y2, where Yq ,Y±,Y2 are the 
diagonal part, the strictly upper triangular part, and the strictly lower tri- 
angular parts of Y, respectively. Then we have 

||[x,y]|| 2 = ||[jr,y 1 ]|| 2 + ||[x 1 y 2 ]|| a 
<||A;|| a -(||y 1 || 2 + + ||y 2 || 2 + ||y 2 || 2 00 ) 
< ||x|| 2 -(||y|| 2 + 2||y|| 2 ), 

and the theorem is proved. 

□ 

Chern-do Carmo-Kobayashi [5] already had the Bottcher-Wenzel inequal- 
ity when one of the matrices is symmetric. Their proof actually yields the 
following result. 

Theorem 3. Let X be a symmetric n x n matrix with Ai being the largest 
eigenvalue and X n being the smallest eigenvalue. Let Y be an n x n matrix. 
Then 

\\[X,Y]\\ 2 <(X 1 -X n ) 2 \\Y\\ 2 . 

Proof. Without loss of generality, we may assume that X is a diagonal 
matrix. Thus we have 

||[x,y]|| 2 = ^T(A, - xtfyl < (Ax - A„) 2 ||y|| 2 , 

where (yij) are the entries of Y. 

□ 

Let X be a real n x n matrix and let s±, ■ ■ ■ ,s n be the singular values of 
X. The (2, (2))-Ky Fan norm of X is defined as 

11*112,(2) = \J s i + 4- 

Obviously, we have ||-X"||2,(2) — 11*11- From the proof of Theorem [TJ we 
actually have 

(8) ||[x,y]|| 2 <2||x|| 2 (2) ||y|| 2 , 

which is a generalization of the Bottcher-Wenzel inequality. This strength- 
ened inequality was first proved by Wenzel [TT] . 



REMARKS ON THE BOTTCHER- WENZEL INEQUALITY 



5 



Evidently, we have 

(Ai - A n ) 2 < 2 max (Af + X 2 ,). 

Therefore, if X is a symmetric matrix, then the Chern-do Carmo-Kobayashi 
inequality is sharper than the Wenzel inequality (|8|). On the other hand, 
in a lot of cases inequality (|7|) is sharper than both the Chern-do Carmo- 
Kobayashi and the Wenzel's inequalities because the oo norm is usually 
much smaller. We wish to obtain a common generalization of the above 
three inequalities. Such a result would provide a common generalization of 
the Bottcher- Wenzel inequality and the Normal Scalar Curvature inequality 
proved in [9] and [7]. 

Finally, a generalization of Theorem [2] may exist in connection with the 
p Schatten norms considered in [^1114 fT2]. 

Acknowledgement. The author deeply thanks Professors A. Bottcher 
and D. Wenzel for their many useful comments without which the paper 
would not be in its current form. 
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